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CIRCUIT SPLITTING IN ANALYSIS OF CIRCUITS AT TRANSISTOR LEVEL 



Background 

[0001] This application relates to analysis of integrated 
circuits . 

[0002] Circuits may be viewed as networks of nodes and circuit 
elements connected between nodes. As such^ circuits may be 
analyzed based on a nodal analysis where a nodal equation may 
be written for each node based on the conservation of charge 
at the node^ i.e.^ the total current entering the node is 
equal to the total current leaving the node (the Kirchoff^s 
second rule) . For a circuit with N nodes N equations for the 
N nodes can be expressed in terms of the properties of circuit 
elements such as resistance, capacitance, and inductance,, and 
in terms of the node voltages and currents- These N equations 
can be written into a matrix equation and are solved using 
various matrix approaches such as LU decompositions. 
[0003] Integrated circuits with transistors can be simulated 
using direct methods such as LU decompositions. One example, 
the Berkeley SPICE2 simulator and its variations use LU 
decompositions to solve for circuit equations for circuits 
with transistors. See, Nagal, ^'Spice2: A computer program to 
simulate semiconductor circuits,'' Tech. Rep. ERL M520, 
Electronics Research Laboratory Report, UC Berkeley (1975) . 
The direct simulation methods may become less effective and 
can reach their computational limits when the number of 
transistors and other elements in circuits approaches the 
capacity limit, e.g., around 50,000 transistors for some 
direct simulation methods. This is in part because the super 
linear complexity 0(n^'^) increases with the number of circuit 
nodes, n, and the amount of the extracted interconnect data 
for a large n can exceed the capacity of the software based on 
a direct simulation method. 
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Siammary 

[0004] This application describes, among others, a general 
operator splitting technique for analyzing large-scale 
integrated circuits to provide ensured convergence and 
improved simulation speed with high accuracy. The general 
operator splitting technique can be used to reduce the amount 
of non-zero fill-ins during the LU factorization. Various 
implementations of the splitting are described. 
[0005] In one implementation, a method for analyzing an 
integrated circuit includes splitting the integrated circuit 
into two sub circuits which are in or close to tree or forest 
structures and have a DC path to GND or VDD for each node; and 
applying a direct matrix inversion method to a circuit matrix 
equation for each of the two sub circuits to find solutions to 
the sub circuits- Accordingly, an article is also described 
to include at least one machine-readable medium that stores 
machine-executable instructions for the above method. The 
instructions cause a machine to split the integrated circuit 
into two sub circuits which are in or close to tree or forest 
strcutrues and have a DC path to GND or VDD for each node; and 
apply a direct matrix inversion method to a circuit matrix 
equation for each of the two sub circuits to find solutions to 
the sub circuits. 

[0006] In another implementation, a method for analyzing an 
integrated circuit is described to represent the integrated 
circuit with a graph. The graph is divided into two subgraphs 
that respectively represent two sub circuits of the integrated 
circuit. In the division process, a resistor is divided 
evenly between the two subgraphs while a transistor, a 
transistor device, a capacitor, or an inductor is duplicated 
in the two subgraphs. Each node of each subgraph has a DC 
path to GND or VDD, The solutions to the two subgraphs are 
found for analyzing the integrated circuit. Accordingly, an 
article is described to include at least one machine-readable 
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medium that stores machine-executable instructions for the 
above method. 

[0007] In yet another implementation an integrated circuit 

under analysis is represented by a graph. From one or more 
5 nodes connected to GND or VDD in the graphs a search for nodes 
is conducted in the graph. The edges of nodes from the search 
are divided to form two subgraphs that respectively represent 
two sub circuits of the integrated circuit. The two subgraphs 
are modified to ensure each node of each subgraph to have a DC 
10 path to GND or VDD. The two subgraphs are analyzed to find 
solution to the integrated circuit. Accordingly^ an article 
is described to include at least one machine-readable medium 
that stores machine-executable instructions for the above 
method . 

15 [0008] These and other implementations and their applications 
are described in greater detail in the attached drawings, the 
following detailed description, and the claims. 

Brief Description of Drawings 

20 [0009] FIGS. lA and IB show one example of a simple transistor 
circuit and its corresponding graph, respectively. 
[0010] FIGS. 2A, 2B and 2C illustrate one example of the 
splitting of the circuit in FIG. lA according to one 
implementation . 

25 [0011] FIG. 3 illustrates one implementation of the splitting 
process . 

[0012] FIGS- 4A, 4B, 4C, 4D, 4E, 4F and 4G show a step-by-step 
illustration of the application of the splitting in FIG. 3 for 
a simple 6x6 mesh transistor circuit. 
30 [0013] FIGS. 5A, 5B, 5C and 5D show simulation results using 
the present splitting technique. 



^3^ 
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Detailed Description 
[0014] The circuit splitting described here partitions the 

circuit equation for a complex integrated circuit into two 
less complicated subcircuits to allow for efficient 
5 computations of the circuit equations for the two subcircuits 
and to provide ensured convergence. The subcircuits have 
fewer elements than the original circuit. The splitting 
concept was introduced as a technique for solving partial 
differential equations. See,, e-g.f W. F. Ames,. "Numerical 
10 Methods for Partial Differential Equations,." 2nd edition,. New 
York Academic Press (1977) - This operator splitting can be 
illustrated with the following initial value problem (IVP) of 
a simple ordinary differential equation (ODE) , 



15 4r- = ^w (1) 

St 



where L is a linear or nonlinear operator and can be written 
as a linear sum of m suboperators Li, L2f—r B.nd of u, 

20 Lu^L{u^Lju^'- + L^u (2) 

Suppose ui,U2f . - . , Um are updating operators on u with respect 
to Lif Lzf • • • r Lm from time step n to time step n + 1,. the 
operator splitting approach has the form of: 



25 



(3) 



where each partial operation acts with all the terms of the 
original operator . 
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[0015] The circuit splitting described here uses the graph 
theory as the basis to generate a graph representation of an 
integrated circuit under analysis and then applies a set of 
partition rules to split the graph for the original circuit 
5 into two subgraphs respectively representing two smaller 
circuits. Circuits nodes are represented by vertices of 
graphs and circuit connections between different nodes are 
represented by edges in graphs. This use of the graph based 
modeling for circuits can be used to provide a generalized 

10 splitting algorithm and the generalization removes constraints 
of the geometry or locality of circuits under analysis . 
Notably, this approach is unconditionally stable and hence 
essentially eliminates the convergence issues in many direct 
methods for simulating large and complex circuits such as the 

15 LU decompositions - 

[0016] The following sections use a general circuit system 
to describe the circuit operator splitting method. It is 
assumed that the circuit includes resistors, capacitors, and 
inductors with mutual couplings. For linear circuits, the 

20 nodal analysis using Backward Euler Integration can be 
expressed as below: 
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25 where C, L, R, G are the matrices of capacitances, 

inductances, resistances, and conductances, respectively; the 
matrix A is an incidence matrix linking between the topology 
of capacitance nodes and inductance branches; the vectors V, 
I, and U describe the voltages of capacitance nodes, currents 

30 of inductance branches, and system inputs, respectively. The 

scalar h is the time step from time t to time (t+h) . Note 

that the four matrices, C, L, R, and G, are symmetric by 

^5^ 
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construction and are positive semidef inite because the circuit 
elements such as capacitances;, inductances;, resistances;, and 
conductances are non-active. In addition^ matrices C and L 
are assumed to be positive definite for a nondegenerated case. 



R = Ri + R2 and 
G = Gi + G2 - 

By construction^ the matrices Ri and Gi for i e {1^2} are 
15 symmetric and positive semidef inite . Following the circuit 

partition, the integration in Equation (4) is divided into two 
half steps and alternates the forward and backward 
integrations between the partitions as follows: 



5 



[0017] The circuit represented by Equation (4) can be 
partitioned into any two arbitrary subcircuits. The 
corresponding two partitions of matrices A, R, and G can be 



written as: 



10 



A =-Ai + A2, 







Vit+K) 
I(t + h) 



20 



(5) 
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10 



In the first half step^^ the forward integration is used for 
the subcircuit with matrices G2 and R2 - Then^. in the second 

half step;. the forward integration is used for the subcircuit 
with matrices Ai^ Gi and Ri. In both half steps the other 
partition is integrated by the backward implicit integration. 
[0018] If the two left-hand-side matrices in the two 
equations in the formulation (5) correspond to trees or forest 
structures^ a direct matrix inversion can be used to 
efficiently solve those two equations because there is no 
nonzero fill-ins and the computational cost is linearly 
proportional to the number of elements- Define notations Pl^ 
P2 and S as follows: 

4T' 



A ^ J 



P2 = 



A R2 . 
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15 



The two equations for the two half steps of the operator 
splitting formulation (5) can be simplified as: 



{P^ + S)X(t + h) = -{P,-S)X\t + -\ + Uit + h) 



(6) 



where X = 



V 
I 



[0019] The above operator splitting formulation allows for 
20 arbitrary splitting of a circuit. The present technique 
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applies a set of splitting rules to ensure more efficient 
processing of the two partitions and the original circuit. 
Such splitting rules are in part based on the recognition that 
the performance of direct methods such as LU decomposition can 
5 still beat iterative methods for small circuits with up to 
tens of thousands of nodes and direct methods become 
prohibitive for large circuits due to the order of ©(n"*"^) to 
0(n^'^) nonzero fill-ins^ where n is the number of nodes in 
circuits. In addition^ it is also known that the LU 

10 decomposition method does not create non-zero fill-ins for 

circuits in tree/forest structure if nodes elimination always 
selects from leaves of the graph. This elimination order can 
be captured by ordering algorithms based on minimum degrees 
(i.e., the number of neighboring nodes for a particular node). 

15 [0020] Following this observation, the present operator 

splitting algorithm splits a circuit into two partitions with 
structures close to tree or forest such that the number of 
non-zero fill-ins is minimized. The partition is carried out 
in two steps: graph representation of the circuit and 

20 application of the splitting rules to form two sub graphs . 
[0021] In the implementations described here, a circuit 
structure can be represented by an undirected graph. The 
edges in the undirected graph are divided into two sub-graphs 
using the graph theory. An undirected graph G = (V, E) is 

25 used to represent the circuit structure. 

[0022] In order to ensure the DC convergence of the two 
partitions, every node in both partitions has a DC path to GND 
or VDD. This is one of the rules of the present splitting 
algorithm. As an example, because each capacitor is a circuit 

30 break point in a DC path, there cannot a node between two 

capacitors. In addition, only resistive connections are split 
and resistors are divided into two partitions using graph 
theory algorithms. Capacitors and inductors are not divided 
so that each partition has a full version of each capacitor or 
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inductor. When solving each partition, the rest of circuit 
are modeled as equivalent current sources, following the 
operator splitting formulation (5) or (6) . In many digital 
circuits, transistors are often grouped as various gates. 
5 Taking into consideration the nonlinear property of transistor 
devices and gates, a single transistor or gate is not split 
into different partitions; instead each partition has a full- 
version of all transistor devices. Accordingly, transistor 
devices are treated like capacitors and inductors and thus are 

10 duplicated in each partition and are solved at every half time 
point. Hence, the present splitting algorithm divides each 
resistor branch into two partitions and each partition has a 
full-version of transistor devices, capacitors and inductors. 
Circuit nodes can be classified as super nodes and branch 

15 nodes in the graph for the circuit. A super node denotes the 
end point of resistors in large linear networks or a single 
gate. Branch node represents the end point of resistors on 
signal wires connecting gates in the circuit. The edge 
denotes the resistor branch in the circuit since only 

20 resistors are divided into partitions. End points of 

resisters on signal wires and large linear networks (e.g., a 
power network) are different. Only end points on signal wires 
are considered as branch nodes. With the transistor 
duplication strategy, each gate is represented by a super node 

25 in the graph and the detail inside each gate is invisible to 
the splitting algorithm. The resistors connection points in 
the circuit are represented by a branch node in the graph. 
Each edge eij between nodes vi and Vj in the graph represents a 
corresponding connection in the circuit. 

30 

[0023] FIG. lA shows a simple transistor circuit with three 
inputs and two outputs. FIG. IB shows the corresponding 
undirected graph representation of the circuit in FIG. lA. 
The five super nodes in solid dots represent five gates in the 
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circuit respectively and four branch nodes in hollow dots 
represent the resistive connections in the circuit. The 
outputs are omitted- FIG. 2A shows one possible splitting 
configuration of the graph in FIG. IB where dashed lines and 
5 solid lines denote two different partitions. FIGS. 2B shows 
the sub graph of the solid lines in FIG. 2A and FIG. 2C shows 
the sub graph of the dashed lines in FIG. 2A, 

[0024] The splitting algorithm is applied to divide the graph 
into two sub-graphs. Each sub-graph represents a sub-circuit 

10 and corresponds to a sub-matrix in circuit simulation process. 
' The number of non-zero fill-ins of the matrix LU decomposition 
is to be as small as possible during the partition so that the 
sum of the number of non-zero fill-ins of two sub-matrixes 
much less than the number of non-zero fill-ins of the original 

15 full matrix. To achieve this^ the rules for splitting the 
graph are designed to decrease the degree (the number of 
neighbors) of every node in both sub-graphs compared with the 
degree in the original undirected graph. In the matrix LU 
decomposition^ non-zero fill-ins are introduced among 

20 neighbors of a node when the node is eliminated and as such 

the non-zero fill-ins increase dramatically as the elimination 
progresses and eventually render the LU decomposition 
impractical for solving circuits with a large number of nodes. 
The present splitting algorithm minimizes the degree of every 

25 node in sub-graphs and thus reduces the niimber of non-zero 
fill-ins . 

[0025] In addition to the above rules on the DC paths, 
resistors, inductors, capacitors, and transistors, the 
following rules are applied for the splitting process. 
30 [0026] (1) Branch rule : the edges in one branch belong to the 
same partition. For example, signal wires connecting gates 
are assigned to one partition. In the undirected graph, one 
branch includes the edges connected by branch nodes. If one 
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branch is broken into two partitions, the broken branch node 
could cause undesired iterations during simulation. 
[0027] (2) Degree rule : the edges of a node whose degree is 
two belong to the same partition. The line structure wouldn't 
5 cause many non-zero fill-ins and it will be propitious to 
provide the DC path in the sub-graphs - 

[0028] (3) Loop rule : the loop is to be avoided in each sub- 
graph if possible. Loops in the sub-graphs can potentially 
introduce non-zero fill-ins and thus should be avoided or 
10 minimized if possible. 

[0029] (4) Balance rule : the edges for each node in the graph 
are to be divided into two sub-graphs. 

[0030] Application of the above splitting rules may render the 
partition of a circuit not optimized due to its structure 

15 limitation or the restriction of DC paths (no floating nodes 
is allowed at DC stage) . However, the number of overall non- 
zero fill-ins can still be greatly reduced for most circuits 
in comparison with the number of non-zero fill-ins due to the 
direct methods. In addition, the present splitting ensures 

20 the stability of the computation based on the splitting. 
[0031] FIG. 3 shows one implementation of the splitting 
algorithm. First, a circuit under analysis is represented by 
a corresponding undirected graph of super nodes and branch 
nodes where the VDD and GND nodes are identified. Next, a 

25 breadth first search (BFS) is performed beginning at the 
identified VDD and GND nodes to discover all nodes in the 
graph. Splitting rules are applied to divide edges of each 
node of the graph into two partitions. A DC path post 
processing is then performed to modify the partitions in order 

30 to ensure a DC path for each node in the two sub graphs. The 
final result is two sub-graphs, which correspond to two sub- 
circuits - 

[0032] In the BFS partition stage, the search may begin from 
VDD/GND nodes simultaneously to go through all the nodes in 
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the graph using BFS and to divide the edges of every node into 
two partitions according to the partition rules. In order to 
facilitate the post processing^ each of the nodes and edges is 
associated with a label to record the status of partition and 
5 DC path to VDD/GND. Based on the labeling information, the 
partition rules are applied to benefit the DC path available 
for all the nodes in sub-graphs. 

[0033] In the post processing stage, the partition for the 
edges without a DC path to VDD/GND is adjusted or modified. 

10 According to the labels, the partition status for all the 

edges and nodes is identified to indicate which partition each 
edge or node belongs to and whether each edge or node has a DC 
path to VDD/GND in that partition. From a global view of the 
BFS partition result, some parts of the graph would have DC 

15 paths while other parts of the graph may not have a DC path. 
Therefore^ there are boundary nodes between a part with DC 
paths and another adjacent part without a DC path. The post 
processing is to extend or '^grow'' an existing DC path from a 
boundary node for the edges and nodes without a DC path upon 

20 the partition. For example, if an edge does not have a DC 
path in the partition assigned to it while one of its 
connecting nodes has a DC path in another partition, the 
partition for this edge can be changed to provide a DC path. 
After the change in the partition, the labels for the 

25 corresponding nodes and edges are updated and the DC path may 
be propagated to some other edges without a DC path. In the 
final splitting result, a DC path is made available for all 
the nodes in both sub-graphs. 

[0034] FIGS. 4A through 4G illustrate an example of the above 

30 splitting algorithm for a simple circuit shown by a mesh graph 
in FIG. 4A. Node and edge labels are defined in the insert of 
FIG. 4B to include CONNECTED, UNCONNECTED, ZERO_UNCONNECTED, 
ONE_UNCONNECTED, ZERO__ONE__UNCONNECTED, ZERO_CONNECTED and 
ONE_C0NNECTED, where ^'ZERO'' and ^'ONE'' represent two sub-graphs 
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due to the splitting. The label's name describes the status of 
partition and DC path for the node. A node is ^'connected'' 
when it is connected to a GND or VDD. Hence, the label 
^'ZERO_UNCOISINECTED" for a node means that the node has an edge 
5 in the sub-graph ZERO without a DC path. Other labels are 
self explanatory. 

[0035] FIG- 4A shows the input undirected 6x6 mesh graph for a 
circuit where there are two VDD/GND nodes S(2,5) and S(5,2) 
and all nodes are assumed to be super nodes. FIGS. 4B-4F show 

10 the stepwise changes of splitting status for all the nodes and 
edges in the BFS partition stage. FIG- 4B shows the first 
step of the BFS process which starts from the VDD/GND nodes 
S(2,5) and S{5,2). The edges of nodes S(2,5) and S(5,2) are 
divided into two partitions evenly according to the balance 

15 rule. The labels for the edges and its neighbors also are 
updated at the end of the first step of the BFS partition. 
[0036] Next, the edges for the nodes S{1,5), S(2,6), S(3,5), 
S{2,4), S(4,2), S(5,3), S(6,2) andS(5,l) are divided. The 
edges of nodes S(l,6) and S(6,l) belong to the same partition 

20 according to the degree rule. The edge between nodes S(3,5) 
and S(3,6) are assigned partition to avoid loop according to 
the loop rule. The result of these operations is shown in 
FIG. 4C. 

[0037] FIG. 4D shows additional edges are divided into two 
25 partitions based on the rules. The edges far away from the 

input VDD/GND nodes 3(2,5} and S(5,2) are more likely to have 
no DC path. Six nodes remain unconnected at this time. FIG. 
4E shows that the edges of six unconnected nodes S{1,3), 
S(2,2), 3(3,1), 3(4,6), 3(5,5) and 3(6,4) in FIG. 4Dare 
30 processed and only two nodes 3(1,1) and 3(6,6) are left 

unconnected. The BFS partition continues to the remaining two 
nodes 3(1,1) and 3(6,6). FIG. 4F shows the final partition 
result of the BFS partition stage. There are 32 edges and 28 
edges in the two partitions, respectively, where 23 edges have 
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DC paths in their partition while 37 edges do not have DC 
paths . 

[0038] Next^ the post processing is performed to modify the 
partition to ensure the DC paths for all edges. FIG. 4G shows 
5 the final partition result after the post processing. Both 
sub-graphs in the final result have a tree/forest structure 
and there are 31 and 29 edges in the two partitions, 
respectively- The tree/forest structure greatly benefits the 
LU decomposition operations in the subsequent circuit 

10 simulation. 

[0039] The above splitting algorithm can be shown be 
unconditionally stable. For the analysis of the error 
propagation, the inputs in the operator splitting formulation 
(5) can be ignfored. The two half steps can be combiend and 

15 reduced to a recursive formula: 



where A = (P2 + S)"^ (Pi - S) (Pi + S)"'^(P2 - S) . In proof of the 
20 convergence, the norm as follows is used: 

The matrix S"^ is positive definite because the matrix S is 
positive definite and the inverse of a positive definite 
matrix remains to be positive definite. The following 
25 sections first state the theorem of the unconditional 

stability and then provide the proof of the statement by 
lemmas which follow the theorem. 

[0040] Theorem 3.1: The operator splitting formula (5) is 
stable independent of the step size h. 
30 [0041] Proof: Let p(A) = max ( | Xi (A) | ) , where X±{A) is the 1^^ 
eigenvalue of matrix A. The proposed operator splitting 
approach is stable if p(A)<= 1. 

[0042] From Lemma 3.4, we have the following 

-14- 
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Le t p ( a) = (/J - 5) (i? + sy' {p, - s){p, + sy 

From Lexnina 3.2 and 3.3, we can deduce: /7(A) = p(A^<l 
li&mna. 3.2: 

5 p[{p^+sy {p, -s){p,-^sY (p, -s)) = p({p,-s)ip,+sy' {p, -s){p, +sy) 

Proof: It can be derived that p(^AB) = p(^BA) if matrix A or B is 

nonsingular. Thus, the lemma can be proven by setting A = (P2 + 
S)^^ and B - (Pi - S) (Pi + S)"^ (P2 - S) . 

Lemma 3.3: Given a real matrix M, if real 
10 Zr then p{M)<y . 

The proof can be found in E. L. Wachspress and G. J. Habetler, 
"An alternating-direction-iimplicit iteration technique," J. 
Soc. Ind. and Appl. Math. 8, 403-424(1960). 

2 

lemzna 3.4; (i>-5)(P,+5')'';if ^Hj.. for i6{l,2} 

S 

15 and every real vector 2' - 

Biroof: {P^-S){Pi^SY^ z -IkC-i equivalent to 



{P-S){P,^SY' z\,^z\t^ where y^iP^^s^z 



We expand the inequality expression according to the 
20 definition of the norm. 



/{P^-^S')S-\P,-S)y</{P^^S')S-\P,-^S)y 



(8) 



We expand the product terms and cancel the common items 
on the two sides of the inequality. The expression is reduced 
25 to: 

-15- 
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which is true since is positive semidef inite for ze{l,2} . 

[0043] Though the general operator splitting approach is ^A- 
stable, the local truncation error still needs to be 
controlled below the error tolerance in order to ensure the 
accuracy. By estimating the local truncation error at each 
time point;, we can dynamically adjust the time step to control 
the local truncation error. 

[0044] Consider the system equation before the numerical 
integration^^ 



0' 


>(0' 




-G 






L 


_/(0. 




-A 


-R 





+U(t) 



(10) 



Let M = 



-G A"- 
-A -R 



and ignore the input vector U. 



Equation (10) can be simplified as: 



MX = NX 



(11) 



The exact analytic solution x with time step h can be derived 
as below: 



V- „M-im V 



h^(M-W hUu-^N) I ^s. 
2 6 ^ ^ 



(12) 



The general operator splitting approach can also be 
formulated as : 



(13) 
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where N = Ni -h N2r Ni represents the partition applied Backward 
Euler and Ni denotes the partition applied forward Euler 
integration method. 

[0045] The analytic solution of operator splitting approach 
is derived as below: 



(14) 



=[/ + /2M-'^^^-/^'M"^iVlM-^iV + 0(/2')]x„ (15) 

The local truncation error (LTE) is the difference of operator 
splitting solution and exact solution x: 



LTE = 



(16) 



The local truncation error at each time step should not exceed 
the error tolerance. If the high order terms of local 
truncation error are ignored, the time step when forward Euler 
integration is applied to partition corresponding to Ni can be 
estimated as: 



ErrorTolerance 






x„ 



(17) 



Similarly^ the time step when forward Euler integration is 

applied to partition corresponding to N2 is estimated as: 



ErrorTolerance 


IK' 




^"1 



(18) 



The new time step h is twice of the minimum time step of each 
partition: 

-17- 



wo 2006/132639 



PCT/US2005/020242 



A = 2min(/i,,^) 



(19) 



[0046] 



The above splitting algorithm was implemented in C 



programming language and tested against Berkeley SPICES using 
5 BSIM3 models for transistor devices. Convergence and accuracy 
are guaranteed. Examples were tested on a Linux machine with 
2.6 GHz CPU and 4 Gigabytes memory . 



current sinks ranging from Ilk nodes to 160k nodes were 

10 tested. Various transistor gates draw current from the power 
networks. Those power networks are approximately in mesh 
structures. The splitting algorithm results in very limited 
nonzero fill-ins and the linear runtime of the splitting 
algorithm was. The CPU runtime and speedup are given in Table 

15 I. One or two orders of magnitude speedup (8.1x to 58. 2x) 

against SPICE3 is obtained. The transient waveform circuits is 
given in FIG. 5A. The tests only replaced the LU 
decomposition procedure inside the SPICE3, other overhead such 
as device evaluation and dynamic time step control took more 

20 than 30% of the total runtime and therefore limited the 
overall speedup during our tests. 

The power and clock network tests were done for an RLC 
power ground network and a two-level H-tree clock. FIG. 5B 
shows the voltage drop at one node of the power network. 

25 Transient simulation of 10 ns is completed in 649.5 seconds^ 
which is 18.5 times faster than SPICES as shown in Table I. 

A large RC power network with 0 . 6 million nodes and an 
irregular structure was also tested where some nodes have 
thousands of neighbors. The switching activities that draw 
30 current from the power network were modeled as piecewise 

linear current waveform. Berkeley SPICE3 failed to execute 
because of the memory size and computation time problem. The 
operator splitting approach finished the transient analysis of 



[0047] 



A number of RLC power networks with nonlinear 
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10 ns in just 4083 seconds. FIG. 5C illustrates the voltage 
drop of a node on the power network. 

[0048] Two IK and lOK cells ASIC designs were also tested 
to demonstrate the splitting algorithm'' s ability of handling 
5 transistor dominated nonlinear circuits. The Ik cell circuit 
had 10,200 nodes and 6,500 transistors- The 10k cell circuit 
had 123,600 nodes and 69,000 transistors. It was assumed that 
ideal power and ground supply were provided in those RC 
examples. The splitting algorithm took 415.9 seconds for the 

10 IK cell circuit and 3954.7 seconds for the lOK cell circuit to 
finish 20-ns transient simulations- The speedup over SPICE3 
is 5 . Ix and 11. 2x for these two examples (Table I). Accurate 
waveform match was observed for those two examples. FIG. 5D 
shows the transient waveform of one gate output in the IK cell 

15 design. 

TABLE I 

TRANSIENT SIMULATION RUNTIME 



Examples 


circuitl 


circuit2 


circuits 


circuit4 


Power and Clock 
Network 


Large Power 
Network 


IK-cell 


lOK-cell 


#Nodes 


11,203 


41,321 


92,360 


160,657 


29,100 


615,446 


10,200 


123,600 


#Transistors 


74 


512 


1,108 


2,130 


720 


0 


6,500 ■ 


69, 000 


Simulation 
Period 


10ns 


10ns 


10ns 


10ns 


10ns 


10ns 


20ns 


20ns 


SPICES 
(sec) 


602.44 


8268.92 


39612.32 


N/A 


12015 


N/A 


2121 


44293 


Operator 
Splitting 
(sec) 


74.64 


305.38 


681.18 


1356.21 


649.5 


4083.7 


415.9 


3954.7 


Speedup 


8.1X 


27. Ix 


58. 2x 


N/A 


18. 5x 


N/A 


5.1x 


11. 2x 



[0049] In implementations, the above described techniques and 

20 their variations may be implemented as computer software 

instructions. Such software instructions may be stored in an 

article with one or more machine-readable storage media that 

are not connected to a computer, or stored in one or more 
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machine-readable storage devices connected to one or more 
computers either directly or via a communication link. In 
operation, the instructions are executed by, e.g., one or more 
computer processors, to cause the machine to perform the 
5 described functions and operations for circuit analysis. 

[0050] Only a few implementations are disclosed. However, it 
is understood that variations and enhancements may be made. 
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What is claimed is: 



Claims 
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1. A method for analyzing an integrated circuit, 
comprising: 

representing the integrated circuit with a graph; 

from one or more nodes connected to GND or VDD in the 
graph, conducting a search for nodes in the graph; 

dividing edges of nodes from the search to form two 
subgraphs that respectively represent two sub circuits of the 
integrated circuit; 

modifying the two subgraphs to ensure each node of each 
subgraph to have a DC path to GND or VDD; and 

analyzing the two subgraphs. 

2. The method as in claim 1, wherein a capacitor is not 
divided between the two subgraphs and is duplicated in the two 
subgraphs - 

3. The method as in claim 1, wherein an inductor is not 
divided between the two subgraphs and is duplicated in the two 
subgraphs . 

4. The method as in claim 1, wherein a transistor is not 
divided between the two subgraphs and is duplicated in the two 
subgraphs . 

5. The method as in claim 1, wherein a gate having a 
plurality of transistors is not divided between the two 
subgraphs and is duplicated in the two subgraphs. 

6. The method as in claim 1, wherein edges in one branch 
are assigned to the same subgraph. 



wo 2006/132639 PCT/US2005/020242 

7. The method as in claim 1, wherein edges of a node 
whose degree is two are assigned to the same subgraph. 



8. The method as in claim 1, wherein each subgraph is 
5 free of a loop. 

9. The method as in claim 1,. wherein a number of loops 
for nodes in each subgraph is minimized. 

10 10. The method as in claim 1;. wherein resistors of the 

integrated circuit are evenly divided between the two 
subgraphs . 

11. The method as in claim ly further comprising applying 
15 a direct matrix conversion method to analyze each of the two 

subgraphs . 

12 . The method as in claim 1^ wherein the search is a 
breadth first search. 

20 

13. A method for analyzing an integrated circuity 
comprising: 

splitting the integrated circuit into two sub circuits 
which are in or close to tree or forest strcutrues and have a 
25 DC path to GND or VDD for each node; and 

applying a direct matrix inversion method to a circuit 
matrix equation for each of the two sub circuits to find 
solutions to the sub circuits. 

30 14. The method as in claim 13, wherein the splitting is 

configured to reduce a total number of non-zero fill-ins for 
the matrices for the two sub circuits to be less than a total 
number of non-zero fill-ins for the matrix for the integrated 
circuit prior to the splitting. 
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15. A method for analyzing an integrated circuit, 
comprising : 

representing the integrated circuit with a graph; 
5 dividing the graph into two subgraphs that respectively 

represent two sub circuits of the integrated circuit, where a 
resistor is divided evenly between the two subgraphs while a 
transistor, a transistor device, a capacitor, or an inductor 
is duplicated in the two subgraphs, and wherein each node of 
10 each subgraph has a DC path to GND or VDD; and 

finding solutions to the two subgraphs for analyzing the 
integrated circuit. 

16. The method as in claim 15, wherein edges in one 
15 branch are assigned to the same subgraph. 

17. The method as in claim 15, wherein edges of a node 
whose degree is two are assigned to the same subgraph . 

20 18. The method as in claim 15, wherein each subgraph is 

free of a loop. 

19. The method as in claim 15, wherein a number of loops 
for nodes in each subgraph is minimized. 

25 

20. An article comprising at least one machine-readable 
medium that stores machine-executable instructions, the 
instructions causing a machine to: 

represent the integrated circuit with a graph; 
30 from one or more nodes connected to GND or VDD in the 

graph, conduct a search for nodes in the graphs- 
divide edges of nodes from the search to form two 

subgraphs that respectively represent two sub circuits of the 

integrated circuit; 
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modify the two subgraphs to ensure each node of each 
subgraph to have a DC path to GND or VDD; and 
analyze the two subgraphs . 

5 21. The article as in claim 20, wherein a capacitor is 

not divided between the two subgraphs and is duplicated in the 
two subgraphs . 

22- The article as in claim 20, wherein an inductor is 
10 not divided between the two subgraphs and is duplicated in the 
two subgraphs . 

23. The article as in claim 20, wherein a transistor is 
not divided between the two subgraphs and is duplicated in the 

15 two subgraphs . 

24. The article as in claim 20, wherein a gate having a 
plurality of transistors is not divided between the two 
subgraphs and is duplicated in the two subgraphs. 

20 

25. The article as in claim 20, wherein edges in one 
branch are assigned to the same subgraph. 

26. The article as in claim 20, wherein edges of a node 
25 whose degree is two are assigned to the same subgraph. 

27. The article as in claim 20, wherein each subgraph is 
free of a loop. 

30 28. The article as in claim 20, wherein a number of loops 

for nodes in each subgraph is minimized. 
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29, The article as in claim 20^ wherein resistors of the 
integrated circuit are evenly divided between the two 
subgraphs . 

5 30- The article as in claim 20, further comprising 

applying a direct matrix conversion method to analyze each of 
the two subgraphs . 

31. The article as in claim 20, wherein the search is a 
10 breadth first search. 
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FIG. 2 A 




FIG. 2C 
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FIG. 3 
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FIG, 4 A 
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FIG. 4B 
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FIG. 4G 
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FIG. 5D 
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